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In this paper we apply a new approach of the string theory to the real financial market. The 
strings are defined here by the boundary conditions, characteristic length, real values and the method 
of redistribution of information. The map represents the detrending and data standardization 
procedure. We used 1-end-point, 2-end-point open string and partially compactified strings that 
satisfy the Dirichlet and Neumann boundary conditions. We established two different models to 
predict the behavior of financial forex market. The first model is based on the correlation function as 
invariant and the second one is an application based on the deviations from the closed string/pattern 
form. We found the difference between these two approaches. The first model cannot predict the 
behavior of the forex market with good efficiency in comparison with the second one which is, in 
addition, able to make relevant profit per year. 
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I. INTRODUCTION 



In this paper we apply the string model and approaches described in |l| to real finance forex market. This is an 
extension of the previous work jlj into the real finance market. We derive two models for predictions of EUR/USB 
prices on the forex market. This is the first attempt for real application of the string theory in the field of finance, 
and not only in high energy physics, where it is established very well. Firstly we described briefly some connections 
between these different fields of research. 

We would like to transfer modern physical ideas into the neighboring field called econophysics. The physical 
statistical viewpoint has proved to be fruitful, namely in the description of systems where many-body effects dominate. 
However, standard, accepted by physicists, bottom-up approaches are cumbersome or outright impossible to follow 
the behavior of the complex economic systems, where autonomous models encounter intrinsic variability. 

Modern digital economy is founded on data. Our primary motivation comes from the actual physical concepts [2|, |3j ; 
however, our realization differs from the original attempts in various significant details. Similarly as with most science 
problems, the representation of data is the key to efficient and effective solutions. The string theory development over 
the past 25 years has achieved a high degree of popularity among physicists [J|. 

The underlying link between our approach and the string theory may be seen in the switching from a local to a 
non-local form of data description. This line passes from the single price to the multivalued collection, especially the 
string of prices from the temporal neighborhood, which we term here the string map. It is the relationship between 
more intuitive geometric methods and financial data. Here we work on the concept that is based on projection data 
into higher dimensional vectors in the sense of the works @ . 

The present work exploits time-series which can build the family of the string-motivated models of boundary- 
respecting maps. The purpose of the present data-driven study is to develop statistical techniques for the analysis 
of these objects and moreover for the utilization of such string models onto the forex market. Both of the string 
prediction models in this paper are built on the physical principle. 



II. DEFINITION OF THE STRINGS 



By applying standard methodologies of detrending one may suggest to convert original series of the quotations of 
the mean currency exchange rate p(r) onto a series of returns defined by 

p{r + h) -p(r) 

p(T + h) ' 1 > 

where h denotes a tick lag between currency quotes p(r) and p(r + h), r is the index of the quote. The mean 
p(t) = (Pask(r) +Pbid(r))/2 is calculated from p as k(r) and PbidO). 

In the spirit of the string theory it would be better to start with the 1-end-point open string map 

P^h) = p{T+ ( h) -^ T \ he<0,l s > (2) 

p(T + h) 

where the superscript (1) refers to the number of endpoints. 

Later, we may use the notation P{p} which emphasizes functional dependence upon the currency exchange rate 
{p}. It should also be noted that the use of P highlights the canonical formal correspondence between the rate of 
return and the internal string momentum. 

Here the tick variable h may be interpreted as a variable which extends along the extra dimension limited by the 
string size l s . A natural consequence of the transform, Eq.@, is the fulfillment of the boundary condition 

p(i)( T)0 )=0, (3) 

which holds for any tick coordinate r. Later on, we want to highlight effects of the rare events. For this purpose, we 
introduce a power-law Q-deformed model 



P«(r,/i)= 1- 



p{r + h) 



Q > . (4) 



The 1-end-point string has defined the origin, but it reflects the linear trend in p(.) at the scale l s . Therefore, the 
1-end-point string map Pq^{.) may be understood as a Q-deformed generalization of the currency returns. 
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( 2") 

Clearly, the situation with a long-term trend is partially corrected by hxing Pq ' (r, h) at h = l s . The open string with 
two end points is introduced via the nonlinear map which combines information about trends of p at two sequential 
segments 



P^(r,h) 



p(r) 



_p{t + h) 




p(r + h) 



p(r + l s ) 



h e< 0,4 > 



The map is suggested to include boundary conditions of Dirichlet type 



pW(T,0)=P q (r,l s )=0 ) 



at all ticks r . 



(5) 



(6) 



(2) 

In particular, the sign of Pq (r,h) comprises information about the behavior differences of p(.) at three quotes 



(r, r + h,T + l a ). TheP g (2) (r,/i) < occurs for trends of the different sign, whereas Pq ' (r, h) > indicates the match 
of the signs. 

Now we define partially compactified strings. In the frame of the string theory, the compactification attempts to 
ensure compatibility of the universe based on the four observable dimensions with twenty-six dimensions found in the 
theoretical model systems. From the standpoint of the problems considered here, the compactification may be viewed 
as an act of the information reduction of the original signal data, which makes the transformed signal periodic. Of 
course, it is not very favorable to close strings by the complete periodization of real input signals. Partial closure 
would be more interesting. This uses pre-mapping 



>(2), 



1 



JV B -1 
m— 



p(r + l s m) , 



(7) 



where the input of any open string (see e.g. Eq.@, Eq.([5])) is made up partially compact. 

Thus, data from the interval < r, r + l s {N m — 1) > are being pressed to occupy "little space" h e< 0,l s >■ We 
see that as iV m increases, deviations of p from the periodic signal become less pronounced. For example, one might 
consider the construction of the (D + l)-branc 



u 



p(r + hp) -p{r) 




{r + hj) 



by means of the function 



f q (x) = aign(x) \x 



Q 



Q>0 



(8) 



(9) 



maintained by combining (D + 1) 1-end-point strings, where partial compactification in D extra dimensions is as- 
sumed. Of course, the construction introduces auxiliary variables pj (r) = 53m=o 1 P( T ^ zm h,j)- The corresponding 
statistical characteristics of all the strings and brane described above were displayed in detail in The prediction 
models were tested on the tick by tick one year data of EUR/USD major currency pair from ICAP market maker. 
More precisely, we selected period from October 2009 to September 2010. 



III. CORRELATION FUNCTION AS INVARIANT 



The meaning of invariant is that something does not change under transformation, e.q. such as some equations 
from one reference frame to another. We want to extend this idea also on the finance market and find there some 
invariants in the finance data and utilize this as a prediction for the next prices. Unfortunately this model is able to 
define only one step prediction, see definition bellow. 

We suppose the invariant in the form of the correlation function 

<7(Mo) - E »* f 1 - — ) ( 1 - P —) . (10) 

^ \ Pt-l-hJ V Pt-2-hJ 



with 



e -h/A 
Wh = T l e - W A ' 

2^,h'=o e 
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including dependence on the time scale parameters I, Iq and A. The relative weights satisfy automatically X>/i=o w h = ^- 
From the condition of the invariance at the step (t + 1) 



we finally obtain the prediction 

Pt+i =Pt I 1 + 

valid for p t ^ Pt—i- These are general definitions for the correlation invariants. 



C(t,o) — C(t+i,o) — C(t+i.o) 
1 C(t+i,i) ~ C(t,o) 



(12) 



(13) 



A. Prediction model based on the string invariants 

Now we want to take the above-mentioned ideas onto the string maps of finance data. We would like to utilize the 
power of the nonlinear string maps of finance data and establish some prediction models to predict the behavior of 
market similarly as in the works 043- We suggest the method where one string is continuously deformed into the 
other. We analyze 1-end-point and 2-end-point mixed string models. The family of invariants is written using the 
parametrization 

A 

C(r,A) = (l- m )(l- m )Y / W(h) (14) 

h=Q 



X 1 



p(r) 



p(t + h) 



1 - 



p(r + h) 



(i- m )j2w(h) i- 



h=0 



P (t + l s ) 
p(r) 



p(r + h) 



m ]T W(h) l 

h=0 \ 



P(t + h) 



P (t + I s ) 



(15) 



where rji £ (—1,1), 772 G (—1,1) are variables (variables which we may be called homotopy parameters), Q is a real 
valued parameter, and the weight W(h) is chosen in the bimodal single parameter form 



W{h) 



I -Wo, h<l s /2, 
Wo , h > l s /2 . 



We plan to express p(r + l s ) in terms of the auxilliary variables 
A 1 (A) = (l- m )(l- m )Y / W(h) (l- 



h=0 
A 



p(r) 



p(r + h) 



A 2 (A) 



-(l- Vl )(l-T] 2 )J2W(h) ( 1 
h=Q 

A 3 (A) = m (l- m )J2w(h) (l- 

h-- 

A 

A A {k) = m Y J W{h) 



p(r) 


1 


_p{r + h)_ 





p Q (T + h) 



h=0 



P (t) 


1 


_p{r + h)_ 





h=0 
A 



A 5 (A) = - m J2W(h)p Q (r + h) 

h=0 

Thus the expected prediction form reads 

P(tq + Z pr ) 



A 2 (A)+A 5 (A) 



C(r - l B , A) - Ax (A) - A 3 (A) - At (A) 



i/Q 



(16) 

(17) 
(18) 
(19) 
(20) 
(21) 

(22) 
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FIG. 1. The profit of the model on the EUR/USB currency rate with transaction costs included dependence on trades for one 
year period. 

where we use the notation r = tq + l pr — l s - The derivation is based on the invariance 

C(t, l s — Zp r ) = C(t — i pr , l s — l pl ) , A = l s — l pr , (23) 

where l pr denotes the prediction scale. 

The model was tested for various sets of parameters l s , l pT , 771, 772, Q and the new parameter e which is defined as 

e = \C(t, l s - V) - C(t - Jpr, - V)l ( 24 ) 

and describes the level of invariance in real data. The best prediction (the best means that the model has the best 
ability to guess the right price) is obtained by using the following values of parameters 

l s = 900, 

m = 0, 

r? 2 =0, 
Q = 6, 

e=10- 10 . (25) 

The graphical descriptions of the prediction behavior of the model with and without transaction costs on the EUR/USB 
currency rate of forex market are described in Figs 1-4. During one year period the model lost around 20% of initial 
money. It executed 1983 trades (Fig 1) where only 10 were suggested by the model (and earned money) and the rest 
of them were random (which can be clearly seen in Figs 3,4). The problem of this model is its prediction length (the 
parameter l pl ), in this case it is one tick ahead. The price was predicted correctly in 48.57% of all cases (16201 in 
one year) and from these 48.57% or numerally 7869 cases only 0.13% or numerally 10 were suitable for trading. This 
small percentage is caused by the fact that the price does not change too often one tick ahead. One could try to 
raise the prediction length to find more suitable cases for trading. This is only partly successful because the rising 
parameter ^ pr induces a loss of the prediction strength of the model. For example when Z pr = 2 (two ticks ahead) 
prediction strength decreases from around 50% to 15%. 



6 




50 100 150 200 250 300 350 400 

days 



FIG. 2. The profit of the model on the EUR/USB currency rate with transaction costs included dependence on days for one 
year period. 
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FIG. 3. The profit of the model on the EUR/USB currency rate without transaction costs included dependence on trades for 
one year period. 
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FIG. 4. The profit of the model on the EUR/USB currency rate without transaction costs included dependence on days for 
one year period. 
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IV. PREDICTION MODEL BASED ON THE DEVIATIONS FROM THE CLOSED STRING/PATTERN 

FORM 

For the next trading strategy we want to define some real values of the string sequences. Therefore we define 
the momentum which acquired values from the interval (0,1). To study the deviations from the benchmark string 
sequence we define momentum with some regular function Fcs(h/l s ) as 



where 



and 



M { i s;Q) (r + l s ) = 



Pstand(T + h]l s ) = 



1 h 

— Y 

+ 1 ^ 

h=o 



Psta.nd{h,T;l s ) - F C S 



p(r + h) -Pmin(r;/ S ) 



Pstand € (0,1), 



(26) 



Pm ax ( T 'Js) = max p(T + h), Pmin(r;/ S )= min p(r + h). 

/i€{0,l,2,...,J B } he{0,l,2,...,« B } 

The regular function from the definition above for closed string has to fulfill the condition -Fcs(O) = i*cs(l)) = 1, 
where F C s € (0, 1). 



A. Elementary trading strategy based on the probability density function of M 

Effort is to take advantage whenever the market conditions are favorable. 
We can image the set of instructions: 

• The series of (l s + 1) price tics [p(r),p(r + 1), . . . ,p(r + l s )] is transformed into single representative real value M(r + l s ); 

• nearly stationary series of M(j + l s ) yields statistics which can be split into: 

— branch where M is linked with future uptrend/downtrend; 

— branch where M is linked with future profit/loss taking into account transaction costs; 

• accumulation of pdf(M^~ g ) (profit+ / loss-); pdf(AT^~ rj .) (profit+ / loss-); 

Arbitrage opportunity - taking advantage of the occurrence of difference in distribution 
Opportunity is measured by Kullback-Liebler divergence 

j(bms) 

• larger Dkl means better opportunities (Z?kl > -^threshold); 

• Statistical significance: the smaller the statistics accumulated into bins pdf(M+(j)), pdf(M~(j)), the higher is 
the risk (M from selected range should be widespread). 

Again as in the previous section the model was tested for various sets of free parameters l s , h, Q and the parameters 
used in the function Fes- This model can make "more-tick" predictions (in tests it varies from 100 to 5000 ticks). 
Therefore it is much more successful than the previous model. It is able to make final profit of around 160% but this 
huge profit precedes a fall down of 140% of the initial state. It is important to emphasize that all profits mentioned 
here and below are achieved by using leverage (borrowing money) from 1 to 10. The reason for leverage is the fact 
that the model could simultaneously open up to 10 positions (one position means one trade i.e. one pair of buy-sell 
transactions). If one decides not to use any leverage the final profit decreases 10 times. On the other hand, with using 
the leverage 1 to 20 the final profit doubles itself. Of course, the use of higher leverages is riskier as also dropdowns 
are higher. There is, for example, in Fig. 5 a dropdown circa 6% around 600 trades. With the use of leverage 1 to 20 
this dropdown rises to 12%. 

128000 combinations of model's parameters have been calculated. Figures 5-8 describe some interesting cases of the 
prediction behavior of the model with the transaction cost included on the EUR/USB currency rate of forex market. 
Figures 5,6 describe the model (one set of parameters) with the requirement that the fall down must not be higher 
than 5%. The best profit achieved in this case is 12%. 
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FIG. 5. The profit of the model on the EUR/USB currency rate with transaction costs included dependence on trades for one 
year period. 



In order to sort out best combinations of parameters it is useful to use the statistical quantity called Sharpe ratio. 
The Sharpe ratio is a measure of the excess return per unit of risk in a trading strategy and is defined as 

S= E(R-R f ) 
a 

where R is the asset return, Rf is the return on a benchmark asset (risk free), E(R — Rf) is the mean value of the 
excess of the asset return over the benchmark return, and a is the standard deviation of the excess of the asset return. 

Figure 7 shows the case where the Sharpe ratio has the highest value from all sets of the calculated parameters. One 
year profit is around 26% and the maximum loss is slightly over 5%. Figure 8 describes the case with the requirements 
to high value of Sharpe ratio and with aim to gain profit over 50%. 

There exist sufficiently enough cases with high Sharpe ratio which leads to enhancement of the model to create 
self-education model. This enhancement takes some ticks of data, finds out the best case of parameters (high Sharpe 
ratio and also high profit) and starts trading with these parameters for some period. Meanwhile, trading with 
previously found parameters model is looking for a new best combination of parameters. Figure 9 describes this 
self-education model where parameters are not chosen and the model itself finds the best one from the financial data 
and is subsequently looking for the best values for the next trading strategy. 

V. CONCLUSIONS 

The model of the strings allows one to manipulate with the information stored along several extra dimensions. We 
started from the theory of the 1-end-point and 2-end-point open string and continued with partially compactificd 
strings that satisfy the Dirichlet and Neumann boundary conditions. 

We have shown that the string theory may motivate the adoption of the nonlinear techniques of the data analysis 
with a minimum impact of justification parameters. The numerical study revealed interesting fundamental statistical 
properties of the maps from the data onto string-like objects. The main point here is that the string map gives 
a geometric interpretation of the information value of the data. The results led us to believe that our ideas and 
methodology can contribute to the solution of the problem of the robust portfolio selection. We believe that the 
method affords potential to be used in practical applications, where arbitrage selection bias should be taken into 
account. 

We established two different string prediction models to predict the behavior of forex financial market. The first 
model is based on the correlation function as an invariant and the second one is an application based on the deviations 
from the closed string/pattern form. We found the difference between these two approaches. The first model cannot 
predict the behavior of the forex market with good efficiency in comparison with the second one which, moreover, is 
able to make relevant profit per year. From the results described we can conclude that the invariant model as one 
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FIG. 6. The profit of the model on the EUR/USB currency rate with transaction costs included dependence on days for one 
year period. 
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FIG. 7. The profit of the model on the EUR/USB currency rate with transaction costs included dependence on trades for one 
year period. 



step price prediction is not sufficient for big dynamic changes of the current prices on finance market. As can be seen 
in Figs. 3,4 when the transaction costs are switched off the model has some tendency to make a profit or at least 
preserve fortune. It means that it could be also useful but for the other kind of data, where the dynamics of changes 
are slower, e.g. for energetic jTo| or seismographic data fill ] with longer periods of changes. The profit per year from 
the second string model was obtained from approximately 15 % up to almost 100 % depending on the parameter set 
from the data we have chosen. This model is established very well on finance market and is useful to predict future 
prices for the treading strategy. Of course we need to test the model much more and with the flow of new financial 
data the model will be optimized better. Finally, we can conclude that with these models the string theory gains a 
real sense and can be applied not only in high energy physics but also in the finance market. Maybe this is some 
kind of proof that the string theory is a good candidate for the fundament of matter itself. The presented models are 
universal and could be also useful for predictions of other kind of stochastic data. The self-educated models presented 
in Fig. 9 are very useful because they are able to find the best parameter set from data, learn about the prices and 
utilize these information for the next trading strategy. 



11 




500 1000 1500 2000 2500 3000 3500 

trades 



FIG. 8. The profit of the model on the EUR/USB currency rate with transaction costs included dependence on trades for one 
year period. 
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FIG. 9. The profit of the self education model on the EUR/USB currency rate with transaction costs included dependence on 
trades for one year period. 
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